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We propose a novel space-division based network-coding scheme for multiple-input multiple-output (MIMO) 
two-way relay channels (TWRCs), in which two multi-antenna users exchange information via a multi-antenna relay. 
In the proposed scheme, the overall signal space at the relay is divided into two subspaces. In one subspace, the 
spatial streams of the two users have nearly orthogonal directions, and are completely decoded at the relay. In the 
other subspace, the signal directions of the two users are nearly parallel, and linear functions of the spatial streams 
are computed at the relay, following the principle of physical-layer network coding (PNC). Based on the recovered 
messages and message-functions, the relay generates and forwards network-coded messages to the two users. We show 
■ that, at high signal-to-noise ratio (SNR), the proposed scheme achieves the asymptotic sum-rate capacity of MIMO 
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TWRCs within ilog(5/4) k, 0.161 bits per user-antenna for any antenna configuration and channel realization. We 
perform large-system analysis to derive the average sum-rate of the proposed scheme over Rayleigh-fading MIMO 
TWRCs. We show that the average asymptotic sum-rate gap to the capacity upper bound is at most 0.053 bits per 
relay-antenna. It is demonstrated that the proposed scheme significantly outperforms the existing schemes. 



I. Introduction 

During the past decade, tremendous progress has been made in the field of network coding HI. In [|2l|-|l4l, 
the concept of physical-layer network coding (PNC) was introduced and applied to wireless networks. The 
simplest model for wireless PNC is a two-way relay channel (TWRC), in which two users A and B exchange 
information via an intermediate relay. Compared with conventional schemes, PNC allows the relay to deliver 
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linear functions of the users' messages, which can potentially double the network throughput. It has been 
shown that the PNC scheme can achieve the capacity of a Gaussian TWRC within 1/2 bit per user ||5lll6l. 
and its gap to the capacity vanishes at high signal-to-noise ratio (SNR). 

Recently, efficient communications over MIMO TWRCs have attracted much research interest, where 
the two users and the relay are equipped with multiple antennas. Most work on MIMO TWRCs focuses 
on classical relaying strategies borrowed from one-way relay channels, such as amplify-and-forward (AF) 
[|8l- lfT0l . compress-and-forward [[TT | [[T2 l . and decode-and-forward (DF) [fT3l - lfT5l . These strategies generally 
perform well away from the channel capacity due to noise amplification and multiplexing loss ifTSl . Recently, 
several relaying schemes have been proposed to support PNC in MIMO TWRCs [fT6ll - lfT9l . The basic idea 
is to jointly decompose the channel matrices of the two users to create multiple scalar channels, over which 
multiple PNC streams are transmitted. Let ha, ub, and tir denote the numbers of antennas of user A, user B, 
and the relay, respectively. For configurations with ha^ub > ur, a generalized singular-value-decomposition 
(GSVD) scheme was shown to achieve the asymptotic capacity of MIMO TWRCs at high SNR [[T6l . For 
configurations with nA,nB < ur, all existing schemes may perform quite far away from the capacity. Such 
configurations, however, are of most practical importance. For example, due to the constrained physical sizes 
of the user terminals, it is usually convenient to implement more antennas at the relay station than at the 
user ends, as suggested in the standards of next generation networks ll20ll [ | 2T ]| . 

In this paper, we propose a new space-division based PNC scheme for MIMO TWRCs. Specifically, 
we first establish a novel joint channel decomposition, which characterizes the mutual orthogonality of the 
channel directions of the two users seen at the relay. Based on this decomposition, the overall signal space 
is divided into two orthogonal subspaces. In one subspace, the channel directions of one user are orthogonal 
(or close to orthogonal) to those of the other user. In this subspace, the spatial streams of the two users 
are completely decoded. In the other subspace, the channel directions of the two users are parallel or close 
to parallel. In this subspace, linear functions of the corresponding spatial streams are computed, without 
completely decoding the individual spatial streams. These linear functions of the spatial streams are referred 
to as network-coded messages. The messages and the network-coded messages, respectively generated from 
the two subspaces, are jointly encoded at the relay, and then forwarded to the two users. Afterwards, the 
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two users recover their desired messages. 

We derive the achievable rates of the proposed space-division based PNC scheme for MIMO TWRCs. 
We analytically show that, in the high SNR regime, the proposed scheme can achieve the sum capacity 
of the MIMO TWRC within min{n^, n^} log(5/4) bits, or ilog(5/4) ^ 0.161 bit/user-antenna, for any 
antenna setup and channel realization. This gap is much smaller than (as low as 10% of) the gap for the 
existing best scheme. We also perform large-system analysis to derive the average achievable sum-rate of the 
proposed scheme in Rayleigh fading MIMO TWRCs. We show that, in the high SNR regime, the average 
gap between our scheme and the sum-capacity upper bound is greatest when the antenna configuration is 
riA = riB = \nFi, with the gap being only 0.053 bit/relay-antenna. For all other configurations, the proposed 
scheme perform even closer to the capacity upper bound. Particularly, as the ratio nA/n^ (or UB/riji) tends 
to or 1, the gap to the capacity upper bound vanishes. All these analytical results agree well with the 
simulation. Numerical results demonstrate that the proposed scheme significantly outperforms the existing 
schemes in the literature across the full range of SNRs. 

II. Preliminaries 

A. Notation 

The following notations are used throughout this paper. We use lowercase regular letters for scalars, 
lowercase bold letters for vectors, and uppercase bold letters for matrices. The superscripts (■)^ and (-Y 
denote transpose and Hermitian transpose, respectively. || ■ || and || ■ \\f represent the Euclidian norm of a 
vector and the Frobenius norm of a matrix, respectively. C(X) represents the columnspace of a matrix X. 
j^nxm Qnxm (jg^ote the n-by-m dimensional real space and complex space, respectively. The operation 
log(-) denotes the logarithm with base 2, and | ■ | the determinant. is the indicator function with = 1 
for i = 1 and = for i 7^ 1; [■]+ represents max{-,0}; sign(x) represents the sign of x; Afc{fi,a'^) 
denotes the circularly symmetric complex Gaussian distribution with mean /x and variable o"^. 

B. System Model 

In this paper, we consider a discrete memoryless MIMO TWRC in which users A and B exchange 
information via a relay, as illustrated in Fig. [U User m is equipped with rim antennas, m E {A, B}, and 
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the relay with nji antennas. We assume that there is no direct link between the two users. The channel 
is assumed to be flat-fading and quasi-static, i.e., the channel coefficients remain unchanged during each 
round of information exchange. The channel matrix from user m to the relay is denoted by HmR € C"^^"™, 
and that from the relay to user m is denoted by Unm G C""^"^,??! G {A,B}. We further assume that 
the channel matrices are always of either full column rank or full row rank, whichever is smaller, and are 
globally known by both users as well as by the relay. 

The system operates in a half-duplex mode. Two time-slots are employed for each round of information 
exchange. Following the convention in [|T6ll - lfT8l . we assume that the two time-slots have same duration. The 
extension of our results to the case of unequal duration is straightforward. 

In the first time-slot (referred to as uplink phase), the two users transmit to the relay simultaneously and the 
relay remains silent. The transmit signal matrix at user m is denoted by E C"™^^, m E {A, B}, where T 
is the number of channel uses in one time-slot. Each column of denotes the signal vector transmitted by 
the Um antennas in one channel use. The average power at each user is constrained as [||Xm||^] < Pm, 
m G {A, B}. The received signal at the relay is denoted by Y/j G C"^^"^ with 

Yij = Har^A + ^BR^B + Zfl, (1) 

where Z/j G C^^^^ denotes the additive white Gaussian noise (AWGN) at the relay. We assume that 
the elements of Z/j are independent and identically drawn from ^4(0, A^o)- Upon receiving Yr, the relay 
generates a signal matrix Xr G C"^^^. 

In the second time-slot (referred to as downlink phase), X/j is broadcast to the two users. The average 
power at the relay is constrained as j;E [||X/j||^] < Pr. The signal matrix received by user m is denoted 
by Ym e C"'"^^,m G {A,B}, with 

Y^ = IiRmXR + Z^,me{A,B}, (2) 

where Z^, denotes the AWGN matrix at user m, with the elements independently drawn from A/'c(0, Nq). 

C. Definition of Achievable Rates 

For the system model described above, denote the message of user m by Wm G {1, 2, 2^^^™}. The 
cardinality of Wm is given by 2^-^^™, where the factor of 2T is because each round of information exchange 



consists of two length-T time-slots. At user A, the estimated message of user B, denoted by Wb, is obtained 
from the received signal and the perfect knowledge of the self message Wa- The decoding operation at 
user B is similar. The error probability is defined as Pg — Pr{iy4 7^ Wa or Wb 7^ Wb}- We say that a 
rate-pair (Ra, Rb) is achievable if the error probability Pg vanishes as T tends to infinity. The achievable 
rate-region is defined as the closure of all possible achievable rate-pairs. 

D. Capacity Upper Bound 

Here we briefly describe a capacity upper bound of the MIMO TWRC. Let — -fE [X^Xj^] , m E 
{A,B,R}, be the input covariance matrices. For given {Qa, Qb, Qr} satisfying tr(Qm) < Pm-,^ £ 
{A,B,R}, the achievable rate-pairs {Ra,Rb) of the MEMO TWRC is upper bounded as [[T6l 



Ra < min{P^^(QA),P5^(QR)} 
Rb < min{P^^(QB),Pl^^(Qi?)} 



(3a) 
(3b) 



where 
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(4b) 
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Here, the superscripts "UL" and "DL" respectively represent uplink and downlink, and the factor of 1/2 is 
due to the two time-slots used for each round of information exchange. 

A capacity-region outer bound is defined as the closure of the upper-bound rate-pairs in ([3]). This outer 
bound can be determined by optimizing Q^, Qb, and Q/j, as detailed in lfT6l and ifTTll . The goal of this 
paper is to develop a communication strategy that can approach this outer bound. 



in. Relaying Strategies for TWRCs with Single- Antenna Users 

In this section, we study efficient communications over TWRCs with single-antenna users and a multi- 
antenna relay, i.e., = = 1 and rij? > 1. The results developed in this section will be used in our 
studies on general MIMO TWRCs later. 



A. Relaying Strategies: Complete Decoding versus PNC 

For the case of single-antenna users, the channel model of the uplink phase in ([T]) can be simplified as 

Yr = hARX^ + hsR^l + Zn (5) 

where hmR E C"^^^ is the reduced version of Hm.R, and G C^^^ is the transmit signal vector of user m, 
with the zth entry of x^ being the signal transmitted at the ith time interval, m E {A,B}. Upon receiving 
Yr, the relay generates a network-coded message following the spirit of network coding. This network-coded 
message will be forwarded to the two users in the downlink phase. 

The relay operation is detailed as follows. In the uplink phase, the signal direction of user m is given by 
hmj?, m E {A,B}. On one hand, if Iiar and are orthogonal, both messages of the two users can be 
decoded free of interference from each other. The recovered messages of the two users are then network- 
coded and forwarded to the two users. We refer to this first strategy as the complete-decoding (CD) strategy. 
On the other hand, if h^iR and Iibb. turn out to be parallel (i.e., in a same direction), then it is advantageous 
to compute a linear function of x^ and x^, referred to as a network-coded message, instead of completely 
decoding both x^ and x^. We refer to this second strategy as the PNC strategy. 

In general, the following strategy can be adopted: if h^R and Iibr tend to be orthogonal, the complete- 
decoding strategy is applied; if Iiar and Iibr tend to be parallel, the PNC strategy is applied. The selection 
between these two strategies is based on their achievable rates, as described below. 

1 ) The Complete-Decoding Strategy: For complete-decoding, the uplink channel in (|5]) becomes a multiple- 
access (MAC) channel. Let R^.,m E {A,B}, be the rate of user m for the complete-decoding strategy. 
Then, the uplink rate-region of the complete-decoding strategy, denoted by 1Z'~'^ , is given by 
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(6a) 



RZ"" < \^og[l + ^hl^\irai^,mE{A,B}. (6b) 



which follows from the well-known MAC capacity region [|24B . 

2) The PNC Strategy: For the PNC strategy in |151|[6l, it is required that the two user-signals lie in a 
same spatial direction. This is not guaranteed here due to the availability of multiple antennas at the relay. 



To facilitate PNC, we next propose a projection-based method as follows. The signals from the two users 
are first projected to a common direction, denoted by a unit vector p G C"^^^. The choice of p will be 
discussed in the next subsection. This projection operation creates an aligned scalar channel given by 

P^Yr = p^hAR^l + p^hsR^l + p^Zr (7) 

with the effective channel coefficients given by p^h^R and p'^hsR. 

From [[T6l . if the sum of the two users' codewords, i.e., p^h^RX^^ + p^^h^/jx^, is required to be decoded, 
an achievable rate-pair (i?^^*" , R^^'^) of the uplink phase is given by 

1 



tdPNC 
Rm "2 



log 



,me{A,B}, (8) 



No 

where Qm = f^-E^fx^x^] represents the transmission power of user m. If p^Iiarx^ + p^h^/jxIJ is not 
necessarily decoded, the above rate-pair can be further improved by using the lattice-modulo operation and 
minimum mean-square error (MMSE) scaling ||5l ll22l . with the achievable rate-pair given by 
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Notice that ([8]) and ^ become identical and both approaches the uplink channel capacity at high SNR. 
The uplink rate-region of the PNC scheme is given by 

^PNC A {(R^^R^)\R^ < R^^^,Qm <Pm,me {A,B},p^p = 1} . (10) 
The boundary of TV^^ can be found by optimizing Qa, Qb, and p, as detailed in the next subsection. 



B. Optimization of Projection Direction 



Here we focus on the rate-pair given in ©ill As achievable rate-regions are convex, the boundary points 
of IZ^^'-^ can be determined by solving the weighted sum-rate maximization problem: 

maximize waRT^ + wbRI^^ (Ha) 
subject to I IpI I = < Pm,m e {y4, 5} (lib) 

where wa and wb are arbitrary nonnegative weighting coefficients. By inspecting ([8]), the maximum of (fTTl) 
is achieved at Qm = Pm, fn £ {A, B}. Thus, we only need to optimize p. 

'The treatment for the rate-pair in ^ is similar, and thus omitted. 



Suppose that R^^'^ = (or R^^'-^ = 0). Then, from ([8]), the optimal p is trivially taken as 
(or p^^)- Thus, we focus on the case of R'^^ > 0, m G {A,B}. In this case, this weighted sum-rate 
maximization problem is equivalent to maximizing 

max tij^log (^|p^hA/?f) + log (^|p"^hB/?|^j (12) 

or equivalently 

max h^^p h^^p 

llp||=i 

where p = [Re[p^], Im[p^]]^ and YimR = [R-sfh^^], Im[h^^]]^, m G {A,B}. By setting the derivative of 
the Lagrangian with respect to p to zero, the optimal p satisfies 

Wa^AR . Wsh-BR 



+ 



p^hAR P^hBR 

where a is a scaling factor. Then, with some straightforward algebra, we obtain the optimal projection 
direction given by 
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(14) 



and 7 is a scaling factor to ensure ||p°^*|| = 1. Particularly, for the sum-rate case, i.e., wa = u^b = 1, the 
optimal projection direction p is just the angular bisector of Iciar and Iibr if h^i^h^/j > 0, or the angular 
bisector of h^i? and — h^i? if h^^hBR < 0. By varying wa and wb, TZ^^^ can be determined. 



C. The Overall Scheme 

For the uplink phase, the achievable rate-regions 'Rp'^ and TZ^^^, for certain channel realizations of Iiar. 
and h.BR, are depicted in Fig. |2l The overall uplink rate-region, denoted by TZ^^, is given by the convex hull 
of TZ'^^ and TZ^^^ . In the overall scheme, the relay will select between the complete-decoding and PNC 
strategies for a larger achievable rate-region, according to ^ and (flOl) . 

For the downlink phase, the achievable rate-region is determined as follows. For the complete-decoding 
strategy, the relay jointly re-encode the decoded messages and x^, and forward the resulting codeword 
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to the two users in the downlink. For the PNC strategy, the relay forward the lattice-modulo of p^hy^^^x^ + 
p'^h.BR^^, referred to as a network-coded message, to the two users. Then each user recovers the message 
of the other user with the help of the perfect knowledge of self message. From [fT6l - [fT8l , the downlink 
rate-region 7^^^ for the two strategies are the same and given by 

7^^^ ^ {{Ra,Rb)\Ra < Ra\Rb < Rb""] (15) 

with 

RT = \ log (l + ^^RB^RB^ and ^ = \ log (l + ^^^a^^R^ ■ (16) 

Finally, an achievable rate-region of the overall scheme is the intersection of the uplink rate-region IZ^^ and 
the downlink rate-region 7^^^. 

IV. Space-Division Approach for MIMO TWRCs 

In the preceding section, we have studied the design of relaying strategies for TWRCs with single-antenna 
users. We have shown how to exploit the benefits of the complete-decoding and PNC strategies. In this 
section, we proceed to study the general case of > 1, ra^ > 1. We propose a new space-division based 
network-coding scheme, as a generalization for the case of single-antenna users. 

A. Motivations 

What motivates the proposed space-division approach is the following property of ^ar. and YIbr- Denote 
by C(H.ar) and C(H.br) the columnspaces of the uplink channel matrices ^ar and H^/j, respectively. In 
general, we can partition the columnspace C{H.ar) G C"^ as the direct surrP of three orthogonal subspaces: a 
subspace Sa\\b that is parallel to C^Hbr), i-e-, any vector in Sa\\b belongs to C(Hbr); a subspace Sa]^b that 
is neither parallel nor orthogonal to C(^br)\ and a subspace Sale that is orthogonal to C{YIbr)- Similarly, 
C{H.br) is the direct sum of three orthogonal subspaces Sb\\a-,Sb]I[A, and Sba_a- Note that Sa\\b = Sb\\a 
since both represent the common space of C{H.ar) and C^Hbr)- 

"Let 5 be a vector space, and let 5i, <S2, Sn be subspaces of S. S is defined to be a direct sum of <Si, 52, Sn when <Si, 52, Sn are 

n 

mutually orthogonal and for every vector x in 5, there is Xi in 5,, i — 1, 2, rz, such that x = 2, ^i- 
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In Sa\\b, the signal directions of the two users can be efficiently aligned to a same set of directions, 
providing a platform to carry out PNC, as in lfT6ll - lfT8l . On the other hand, in Sa±b or Sb±a, the two 
users do not interfere with each other, hence the complete-decoding strategy can be employed. The above 
treatments are similar to those for the case of single-antenna users, as discussed in the preceding section. 
What remains is the treatment for the signals in Sa!/ib and Sbi/^a that are neither parallel nor orthogonal. 
Heuristically, some channel directions in Sai/^b and Sb^a may be nearly parallel to each other. For these 
channel directions, the PNC strategy is preferable for the related spatial streams. On the other hand, some 
channel directions in Sai/^b and Sbii^a may be nearly orthogonal to each other. Then, the complete-decoding 
strategy is preferable. The main challenge lies in how to identify those nearly parallel/orthogonal channel 
directions. To this end, we next propose a new joint channel decomposition. 

B. Joint Channel Decomposition 

Let the compact singular value decomposition (SVD) of YlmR be 

= A„Vt„, m G {A, B} (17) 

where Um is an nn-hy-rim orthonormal matrix with Uj„Um = 

Denote by Aj the i\h eigenvalue of the matrix U^U^ + U^U^, and by the corresponding eigenvector. 
Without loss of generality, we arrange {Aj} in the descending order. As the eigenvalues of UmUj„ are either 
1 or 0, the eigenvalues {Aj} are valued between and 2. Note that A; = 2 implies that U^U^Uj= Uj and 
\]B^'^B^i~ ^i- This means that Uj is in the common space Sa\\b of H^^^ and H^ij. Also note that Aj = 1 
implies {UAU^Ui= u^, UBU^Ui= 0} or {\]A^\^i= 0, UBU^Ui= u^} (cf.. Theorem 4.3.4 in JlH). That 
is, VLi is in SAi_B (or which is orthogonal to the space spanned by ^br (or ^ar)- In addition, the 

number of eigenvalues between 1 and 2 is the same as that between and 1, as we will see later. 

Let k be the number of eigenvalues of U^U^ + UbU|j equal to 2; / be the number of eigenvalues between 
1 and 2; (Ia be the number of eigenvalues equal to 1 with {U^U^Uj= Uj, \]B^'^B^i= 0}; (Ib be the number 
of eigenvalues equal to 1 with {\J A^\vii= 0, UBU^Ui= u^}. Also let U G C^RMnA+ns-k) ^ matrix 
with the columns consisting of the eigenvectors corresponding to the ha + ub — k largest eigenvalues of 
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U^U^ + U^U^ (as specified in (|70l)). Due to the orthogonality of the eigenvectors, U is orthonormal, i.e., 

UtU = In^+ns-k. 

We are now ready to present the joint channel decomposition, with the proof given in Appendix A. 
Theorem 1: The channel matrices H^^^ and Hbr can be jointly decomposed as 

ilmR = UD^G^, m e {A, B} (18) 

where G C"'"^"'" is a square matrix, and G C*^"^+"^"^)^"'", m G {A, B} , are orthonormal matrices 
with a block-diagonal structure given by 
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(19b) 



eA;i 



2-Xi 
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and BB-i 



2-Xi 
2 



(19c) 



Remark 1: From (fTSi) . we see that C(U) is the overall columnspace of the two channel matrices, i.e., 
C(U) = C([H^ij Hbr]). Moreover, \JDm specifies the columnspace of H^, i.e., C(UDm) = C(Hm). Note 
that UDm is orthonormal, as Dj^jU^UDm = m G {A, 5}. Therefore, the columns of UD^ give an 
orthogonal basis of C(Hm/?), with the coordinates of HmR specified in Gm- 

Remark 2: The column structures of UD^ and UD^ are explained as follows. In the first place, we note 
that UD^ and UD^ share the same first k columns. Thus, the first k columns of UDa span Sa\\b, i-C-^ the 
common space of C(H^/j) and C(H5/j). Second, from (|19a| ). the last (Ia columns of UDa (obtained from 
multiplying U with the third block column of Da) are orthogonal to UD^. Hence, these columns of UD^i 
span the subspace Saj_b, i-e-^ the subspace orthogonal to C(H5/j). Third, the remaining / columns of UT)a 
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span the subspace by noting the facts that CCHar) = C(UDyi) and that CCHar) is the direct-sum of 

three orthogonal subspaces Sa\\b, ^Affs, and Sa±b- Similarly, the first k columns of \JDb span Sa\\b, the 
next / columns span Ssi/fA, and the last ds columns span Sbj_a- Recall that CCHar) is the direct sum of 
Sa\\b, Sa]/;b, and Sa±b, and that CCHbr) is the direct sum of Sb\\a, Sr^a, and Sb±a- Thus, the dimensions 
of these subspaces have the following relationship: 

k + l + (20) 

We summarize the geometrical meanings of the aforementioned subspaces and their dimensions as follows. 
Subspace Dimension Property 

Sa\\b k common space of C{H.ar) and C{H.br) 

Sa]\b I not parallel/orthogonal to C{H.br) 

Sb)^a I not parallel/orthogonal to CCHar) 

Sa±b dA orthogonal to CCRbr) 

'Sb±a dB orthogonal to C{H.ar) 

Let Vm-j be the ith column of UD^, rn E {A, B}. We refer to v^.j and v^.j as the ith channel direction 
pair. Here, v^.-v^.j = 1 means that v^.j and v^.j are parallel, and v^.^v^.j = means that they are 
orthogonal. Thus, v^.-v^.j can be regarded as a measure of the degree of orthogonality of v^.j and v^.j. In 
the following corollary, the degree of orthogonality of each channel direction pair (v^.j, v^.j) is determined 
by the magnitude of Aj, i.e., the ith eigenvalue of U^U^ + U^U^. 

Corollary 1: For i = 1, k + I, the degree of orthogonality of the ith channel direction pair (v^ j, j) 
is given by -VB^j = Aj - 1. 

Proof: For i = I, k, we see from (|19al) that Aj = 2 and v^.j = v^.j, and so v^.-v^.j = Aj — 1. For 
i = k + 1, k + I, from (|19a|) and (|19b|) . the ith column of UD^ is given by 



e^,i, m e {A,B}, (21) 



where represents the ith column of U. Then, we obtain v^.^^^-v^^fc+j = e\.,^^-eB;k+i = Aj — 1, where the 
first step utilizes the fact that U is orthonormal, and the second step follows from (I19cl) . ■ 



13 

Corollary 2: For i = k + l + 1, ua, v^.j is an eigenvector of UaU^ + U^U^ corresponding to Aj = 1, 
and is orthogonal to C{YIbr)\ for i = k + I + 1, ...,nB, ^B;i is an eigenvector corresponding to A; = 1, and 
is orthogonal to C{YIar)- 

Remark 3: The above corollaries show that the eigenvalue Aj is an indicator of the degree of orthogonality 
of the ith direction pair. In particular, Aj ~ 2 means that the two channel directions are close to parallel; 
and Aj ~ 1 means that the two channel directions are close to orthogonal. 

Remark 4: Before leaving this subsection, we emphasize that the joint channel decomposition in Theorem 
[His general for any sizes of Har and Hbr- Particularly, if Um > nji,m E {A, B}, then k = ur and / = 0, 
implying that all the eigenvalues {A^} are valued at 2. In this case, Har and Her span the same columnspace. 
Channel alignment techniques have been proposed in lfT6ll - lfT8l for efficient implementation of PNC. In what 
follows, we are mainly interested in the case of nA,nB < ur, i.e., there exist {Aj} valued between, but not 
including, 1 and 2. 

C. Space-Division Approach for MIMO Two-Way Relaying 

Based on the joint channel decomposition in Theorem [H we now propose a new space-division approach 
for MIMO two-way relaying. The main idea is to divide the overall signal space C([H^/j H^^]) = C(U) 
into two orthogonal subspaces: 1) iS^^*", in which the channel direction pairs (v^;i,VB;j) are parallel or 
close to parallel, for carrying out PNC; 2) S'-'^ for carrying out the complete-decoding strategy. Let T be 
an arbitrary integer between and /. Recall that the channel direction pairs are ordered by the degree of 
orthogonality as in Corollary \T\ Therefore, the first k + f direction pairs have lower degree of orthogonality 
compared to the remaining pairs. Thus, we allocate the first k + /'direction pairs to form a basis of 5^^*". 
The remaining channel directions give a basis of S^^. In this section, we assume that /'is given. The details 
on the optimization of /'will be discussed later in Sections V and VI. 

1 ) Space-Division Operation: Let the RQ decomposition of be 

G^ = R„.T|„, m e {A, B} (22) 
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where R™ G C 



is an upper-triangular matrix given by 



I'm ) ' I'm 



(23) 



and Tm E C"™^"™ is unitary. Together with (fTSi) . the channel matrices can be jointly decomposed as 

= UD„R„Tj„, m e {A, B} . 

Then, the received signal at the relay, after left-multiplying can be represented as 

= UtY^ = D^R^X:^ + BbKb^'b + Z^, 

where X;, = Tl^X^, m e {A, B}, and = V^Zn with i.i.d. elements ~ A/'c(0, A^o)- 
We partition R^ and as 



(24) 



(25) 



R»i 



Rm;l,l Rm;l,2 
Rm;2,2 



and T)r 



Dm;i,i 
Dm;2,2 



,m G {A,B} 



(26) 



where Rm;i,i G C(''+'^^(^+'^ and Rm;2,2 G C^'^'+^'^^^^'^'+p^^ m G {A, B}, are upper triangular matrices, and 
D^;i,i G C^'^+^OxCfe+O and D^.2,2 G C^^^-'^-^OxCi-r+p,^)^^ ^ {A,B}, are block-diagonal matrices. Then, 
(l25l) can be written as 



me{A,B} 



Dm;l,lRm;l,l ^ m,;l ,l^m;l ,2 








R 






+ 


Dm.2,2Rm;2,2 




vCD 







(27) 



where Y^, X^, and Z^ are correspondingly partitioned as 



y; 



R 



YPNc 


X' — 


v-PNC 

m 




. ^^^"^ . 




m 



, and Z^ 



2^PNC 

R 

Zg^ 



(28) 



Here, the superscript "PNC" (or "CD") represents the PNC (or complete-decoding) strategy. 

Based on the signal model in (l27l) . the proposed space-division based relaying strategy is described as 
follows. At user m, two groups of spatial streams are generated: one group, referred to as the complete- 
decoding spatial streams, form the codeword matrix X,^^; and the other group, referred to as the PNC 
spatial streams, form the codeword matrix X^^*^, m G {A, B}. 
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2) Complete-Decoding Spatial Streams: Due to the block triangular structure of the channel matrices in 
(|27l) . the relay can completely decode the spatial streams X^^ and X^^ free of interference from the PNC 
spatial streams. Specifically, the relay completely decodes both X^^ and X^^ based on 



~ Dm;2,2R.m;2,2X^'^ + Z^'^. (29) 

me{A,B} 

Then, X^^ and X^^ are canceled from the received signal in (l27l) . 

3) PNC Spatial Streams: After the cancelation of X^^ and X^^, the system model for the PNC spatial 
streams is given by 

^ ^^^^ 



YPNc 

R 



rPNC 
'R ■ 



En Vt -W-PNC I yi 

-Urn;!, + ^1 

m&{A,B} 

From (I19al) . the first k columns of Da;i,i and D^^i i are identical; however, for i = + 1, k + /', the 
ith columns of D^^i i and ^B]1,i are not. Following Section III, we project each column pair of Da;i,i and 
Db;1,i onto a common direction, so as to facilitate PNC. 

By inspection, the only difference between the ith columns of D^;i i and Db;i,i is given by the 2-by-l 
vectors eA;i and esji, for i = k + 1, ...,k + T. Without loss of generality, denote by pj a 2-by-l unit vector 
representing the projection direction of 6^;^ and bb-i- The choice of pj is similar to that described in Section 
III and will be detailed in the next section. 

Now the projection process can be described in a matrix form as follows. Define the projection matrix 

Ik ••• 

Pfc+1 ■•• : 

: ■•. ■•. 

••• pfc+r 
After the projection, the resulting signal model is given by 



(31) 



tPNC 
R 



T^PNC 



P Y 



R. 



mG{A,_B} 



PNCv-PNC I r^PNC 



+ z 



R 



(32) 



where H 



PNC 



P^D^;i,iR^;i,i = D„;i,iR„;i,i G C('=+'>('=+^ with 

D^;i,i = diag {l, l,Pfc+ie^;fc+i, ...,pl^iem;k+i] ,m G {A, B} 



(33) 
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and Zg^^ = P^Z|^^^ e C^"^"'^''^ with the entries being i.i.d. random variables ~ AC,(0, Nq). Note that the 
equivalent channel matrices H^^'^ and H^^'^ are {k + l)-hy-{k + 1) square matrices. For such an equivalent 
MIMO TWRC, efficient techniques can be employed to align the signal directions of the two user into a 
same set of A; + T directions. This provides a platform to carry out k PNC streams. 

So far, we have presented the signal processing techniques used in the proposed space-division scheme to 
manipulate the uplink channel. The encoding and decoding details of the overall scheme will be described 
in the next section. 

V. An Achievable Rate-Region of MIMO TWRC 

In this section, we derive an achievable rate-pair of the proposed space-division based PNC scheme. Based 
on that, we optimize the system parameters to determine the achievable rate-region. 

A. Achievable Rate-Pairs 

1) Complete-Decoding Spatial Streams: The equivalent channel model seen by the complete-decoding 
spatial streams is given in (|29l ). with the equivalent channel matrices given by T^m,2,2^m;2,2, ^ {^7 B}. 

The signal model in dlSl) is a standard MIMO MAC channel. Let Qg^ = [Xg^{Xg^y] be the input 
covariance matrix of the complete-decoding spatial steams of user m. Then, the achievable rate-pair of the 
complete-decoding spatial streams satisfies Il24l 

1 



No 

" me{A,B} 

I + -Tr'^m;2,2^m;2,2Qg^^ln;2,2^L;2,2 



(34a) 



,me{A,B}. (34b) 



2) PNC Spatial Streams: The equivalent channel seen by the PNC streams is given in (|32l ). Recall that 
H^^"" is a (A; + /)-by-(/c + /) square matrix, and the efficient design of PNC for this case has been discussed 
in |fT6ll - lfT8l . Here, we follow the GSVD-based approach in [fT6l , as briefly described below. 

Applying the generalized singular-value decomposition (GSVD) [|23l to H,^^'-^, we obtain 



H^^^ = B^^Tl, m e {A, B} , (35) 
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where B G Ci''+^')>^ (''+'■') is a nonsingular matrix, T„i G Ci^+^')>^ (''+'■') is an orthogonal matrix, m E {A,B}, 

is a diagonal matrix with the ith diagonal element denoted by am;i- We further take 
the QR decomposition to the matrix B, yielding 

H^^^ = QRS„Tl„, m G {A, B} , (36) 

where R G C'^'^'^'^^*^'^"*"'^ is an upper triangular matrix. The transmit signal X^^*" in (|32l ) is designed as 

Xr^ = T^</X^^,^e{A5}, (37) 

where = diag{ ^/^W, ■■■ , is a diagonal matrix with ^^.^ > 0,i = 1,2, k + 1', 

and Sm £ C*^^^'^^"^ is the signal matrix with each element independent and identically drawn from A/'c(0, 1). 

Let be the total rate of the PNC spatial streams of user m. From Theorem 1 in [[T6l . the achievable 

rate-pair is given by 



k+l' 

tdPNC ^ 



k+l' 



2 

1=1 



,me{A,B} (38) 



where is the indicator function with = 1 for i = 1 and = for i 7^ 1. 

3) The Overall Scheme: We now consider the overall achievable rate-pair of the proposed space-division 
based PNC scheme. Before going into details, we note that the power constraint of user m, i.e., [\\Xm\\l] < 
Pm, £ {^5 B}, can be equivalently expressed as 



k+l' 



tr{Q^^} + <Pm,me {A,B} . (39) 



We are now ready to present the following theorem on the achievable rates of the proposed scheme. 

Theorem 2: For given Q,^ , and Q/j satisfying (l39l) and tr(Q/?) < Pr, a rate-pair {Ra-, Rb) for the 
MIMO TWRC is achievable if 

R^ < min{i?^^ + C^^, C^}, m G {A, B} , (40) 

where R^^ and R%^ satisfy dS]), R^^^ is given by dMl), and R'^^ is given by ©. 

Proof: Here we provide a sketch of proof. The overall encoding and decoding process for the proposed 
scheme is described as follows. The messages of the user m are doubly indexed as {W^^ ,W^^'-^), with 
W^^ G {1, 2, 2^^-^™°} for the complete-decoding spatial streams, and W^^'-' G {1, 2, 2^^^™^^} for 
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the PNC spatial streams. Each W^^ is one-to-one mapped to in (1271 ). and each W^^'^ is one-to-one 
mapped to S^^^ in ([37]). In the uplink phase, Xg^ and X^^^ = T^^^^^S^^^ are transmitted via the 
channel in (1271 ). with the transmit power constrained by (|39l ). 

Upon receiving Yji, the relay first completely decode X^^ and X^^ based on Y^^ in (|29l ). with the 
achievable rate-pair given in (|34l ). The decoded X^^ and X^^ are subtracted from Y^^^. Let (s^^*")^ be 
the ith row of S^^*^. Then, the network-coded PNC spatial streams, i.e., f- jO'^.j^y.^s^^'^ + f- B i'^^m^B^'^ ' 
i = fc+r, fc+f— 1, 1, are successively recovered and canceled from Y^^^ in (|32l ), with the achievable rate- 
pair given in (|38] ). The decoded messages from the complete-decoding streams, together with the network- 
coded messages from the PNC streams, are then jointly encoded. The new codeword is forwarded to the two 
users in the downlink phase, with the transmit power constrained by tr(Q/j) < Pr. Following the discussions 
in [fT6l - [fT8l , the achievable rate-pair of the downlink phase is given by (-R^^, Rb^) in dH). This completes 
the proof. ■ 

B. Determining Achievable Rate-Region 

Now we consider determining the boundary of the achievable rate-region. From (|40l ). the downlink 
achievable rates are the same as the capacity upper bound in (U). Here, we focus on the uplink rate-region. 

The boundary of the uplink rate-region can be determined by solving the following weighted-sum-rate 
maximization problem: 



maximize 




m 




(41a) 



mG{A,B} 

fc+r 



subject to 



i^m; + tr{Q^^} < P^, h 0, > 0, for t = l,...,k + I'. 



(41b) 




(41d) 



(41c) 



< - log I + ^D^^2,2a^;2,2Qr R]n;2,2DL;2,2 , m G {A, 5} . 



(41e) 



The above problem involves the optimization of T, {pi} 



k+l' 



k+r 



as 



detailed below. 
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1) Determining the Projection Directions: The optimization of the projection directions {'Pi}^tk+i ^ 
maximize the weighted sum-rate is in general difficult to solve. To simplify the problem, we consider the 
high SNR regime, with the weighted sum-rate given by 



waRT"^ + WbRb 



PNC 1 ^ 



' k+l' 



(J 1 
2 



Wm log 

me{A,S} 



P 



(J 1 
2 



rne{A,B} 

id) 1 

2 



P 

m. 



m&{A,B} 



, j=2 



r"^ a"^ -lb ■ 

1,1 m\%Tm\i 

iVn 



"■L-'m;l,l-rv.m;l,l-rv.m;l,l-^m;l,l 



me{A,B} 



P. 



"R-mil.lR'm.;!,! 



+ Wm l0£ 



Dm;l,lDm;l,l 



(42) 



where step (a) follows from substituting (|381) . step (Z?) from the facts that R is upper-triangular and that 
equal power allocation is asymptotically optimal (i.e., T/'^.j = ^), step (c) by noting Dm;i^iRm;i^i = 
Yi^^ = QRS„T|„ (cf., and and step (d) by utilizing |I + AB| = |I + BA|. In the above. 



log 



Dm;l,lDm;l,l 



is the only term related to pj. Recall from (l33l) that Dm;i,i = P^Dm;i,i with Dm;i,i being 
the principle submatrix of in (l26l) . Thus, the weighted sum-rate maximization problem over {pi}^=fc+i 
can be decoupled into /'independent subproblems as 



max w^log 



+ wb log 



for i = A; + 1, /c + 



(43) 



where e^;j and eB^i are given in (|19cl ). From (fT3l) and the discussions therein, the optimal to maximize 
the weighted sum-rate is a real vector given by 



Pi 



li (eA;i + Pi^B^i) , for i = /c + 1, A; + l\ 



where 




D'l 1 



Wb 
wa 



+ 4^_(A,-l) 1--" 

WA V Wa , 



(44) 



(45) 



and 7j is a scaling factor to ensure ||pi|| = 1. 

2) Determining Q^^ and Q^^.' Given {pj} in (|44l) . the optimization problem in (|4T]) can be decoupled 
into two separate problems by predetermining the power allocated to the two signal subspaces. Let be 
the power of user m used for the complete-decoding spatial streams. Then, the power for the PNC streams 
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is given by P^^^ = Pm- ^m^, ^ ^ {A B}. For given P^^ and P^^, the optimal Q^^ and Qg^ to (gB 
can be found by solving the following problem: 



maximize waRa^ + waRa^ 

subject to tr{Q^^} < P^^, Q^^^ ^ 0, 



m,2,2 



m£{A,B} 



I + TrDm,2,2R'm;2,2Qm^R-Jn;2,2Dm^2,2 
iVn 



(46a) 
(46b) 

(46c) 
(46d) 



The above is a standard weighted sum-rate maximization problem for a MIMO multiple-access channel with 
two users [[30l . This problem is convex, and the optimal solution can be numerically obtained using convex 
programming tools ^251. 

f ~i k-\-r 

3) Determining Power Allocation for PNC Streams: Now we consider the optimization of [ipA;i\i-i ^^'^ 
{'ip B:i}'l-[- Given P^'^'^ and P^^'" , the optimal {^Aii}*-'^' ^^^^ be determined by solving 



maximize 



'k+r ^ 



mG{A,B} 

fc+r 



,i=l 



log 



subject to 5^ < P^^^, V^^., > 0, for 2 = 1, + r. 



(47a) 



(47b) 



A similar problem has been considered in [[T6l . and the optimal solution can be obtained by solving the 
Karush-Kuhn-Tuchker (KKT) conditions. We omit details here for simplicity. 

Base on the above discussions, the weighted sum-rate problem in (|4TI) is numerically solvable given the 
values of l\ P^^ and P^^'^, m G {A, B}. The optimal T, P^-^ and P^^^, m e {A, B} can be found using 
the exhaustive search. The complexity involved is not significant by noting P^^ + P^^'^ = P^, m G {A, B] 
and the fact that Tis an integer between and /. 



VI. Asymptotic Sum-Rate Analysis 

In the preceding section, we have shown the achievable rates of the proposed space-division based network- 
coding strategy for MIMO TWRCs. In general, it is difficult to represent the achievable rate of the optimized 
space-division based scheme in a closed-form. Thus, it is not easy to evaluate the gap between the achievable 
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rate of the proposed scheme and the capacity upper bound of the MIMO TWRC. In this section, we derive 
a closed-form expression for the asymptotic sum-rate of the proposed strategy in the high SNR regime. 



A. Asymptotic Sum-Rate as SNR — )■ oo 

Here, we analyze the uplink achievable sum-rate 

i?"^= E + (48) 

me{A,B} 

as the SNRs, i.e., and j^, tend to infinity. It is known that, in the high SNR regime, equal power 
allocation is asymptotically optimal. Then, the upper bound of the uplink sum-rate of the MIMO TWRC is 
given by (cf., ©) 

1 ^ . 

(49) 



R'^'^l E log + 1^H„H^ 



m£{A,B} 



NqU. 



where "x ^ means 



lim (x — y) = 0. 

SNR — ^oo 



Now, we present the following theorem on the asymptotic sum-rate of the proposed scheme. Denote by 
R^^ the uplink achievable sum-rate of the proposed space-division scheme. 

Theorem 3: For a given l", the uplink achievable sum-rate of the proposed space-division scheme satisfies 



where 



lim R^^ - R^^ = A^^ (50a) 

SNR — S.OO 



k+l' . k+l 

A^^^-log n y-log n VA.(2-A.)>0. (50b) 

i=k+l i=k+l'+l 



The proof of Theorem 3 can be found in Appendix B. Notice that the first term in (|50b| ). i.e., — log Y\ 



k+l' 

2 

i=k+l 

k+l 

is the rate loss incurred by the PNC spatial streams, and the second term, i.e., log Y\ A/Aj(2 — Aj), is 

i=k+i'+i 

that incurred by the complete-decoding spatial streams. 

Remark 5: For the case of nA.ns > n^, we have / = and Aj = 2 for i = l,...,k. (See Remark 
(HI).) Then, from (I50bl) . we have A"^^ = 0, which means that the scheme is asymptotically optimal. This 
agrees with the fact that our proposed space-division scheme reduces to the GSVD scheme which is indeed 
asymptotically optimal in the high SNR regime lfT6l . 
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Corollary 3: The optimal I' to minimize the rate gap A 



in (|50b| ) satisfies 



2 > Afe+i > ... > Afe+r > -z- > Afc+z'+i > ... > \k+i > 1- 



(51) 



With this choice of T, the asymptotic rate gap A'^-^ is at most /log(5/4) bits, which occurs when A^+i = 



Remark 6: From the above corollary, the asymptotic gap to the sum-capacity upper bound is /log(5/4) 
bits for the worst case. Noting / < rim, m G {A, B}, we see that the gap is at most min{nA, ns} log(5/4) 
bits, or ilog(5/4) ^ 0.161 bits per user-antenna. 

B. Average Sum-Rate via Large-System Analysis 

In this subsection, we investigate the statistical average of the rate gap A"^^ in fading channels. To this end, 
the distribution of {Aj}, i.e., the eigenvalues of UaU^ + U^U^, is required. However, such a distribution 
is difficult to obtain in general. Here, we employ the large-system analysis to find an approximation of the 
distribution of {A;}. The distribution obtained in this way becomes exact as the number of antennas in the 
system is large. 

We assume Rayleigh fading, in which the channel coefficients are i.i.d. circularly symmetric complex 
Gaussian random variables. Then, the matrices and in (flTI) are truncated uniformly distributed 
unitary matrices, or alternatively, are asymptotically free random matrices [,261 . Thus, we can use the theory 
of free probabilities to derive the asymptotic eigenvalue distribution (a.e.d.) of U^U^ + U^U^ as nji tends 
to infinity, with the result given in the lemma below. Define r],^ = G {A, B}. 

Lemma 1: As n/j — )• oo with — ?■ ?7a and ^ — t- r^^, the a.e.d. of U^U^ + UbU^ is given by 



where 5 {■) is a Dirac delta function and Im [■] is the imaginary part of a complex number. 

The proof of the above lemma can be found in Appendix C. As — )■ oo, we see that for r/^ + r/^ > 1, 
the portion of eigenvalues {Aj} equal to 2 is given by 77^ + 77^ — 1. This portion corresponds to the dimension 




[l-r,A-riBrH>^) + \VA-riB\H^-^) + VlA + riB-WH^-'^) 
1 \J{l-riA-riB?-{2X-X^) (l-(^^i^" 




(52) 
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of the common space Sa\\b of 'Rar and Hbr- In addition, for r/^ ^ i]^, the portion of eigenvalues {Aj} 
equal to 1 is given by \r]A ~ Vb\- This portion corresponds to the dimension of Sa±b if Va ^ Vb or the 
dimension of Sb±a if Va < Vb- 

We are now ready to present the following asymptotic result. 

Theorem 4: As — i- oo with — i]a and — — > 77^, the gap to the capacity upper bound satisfies 

Ft 

/\SD ( r\ /•2 x\ 

r^r,^ li^ ^_ /■ iog^A(2-A)+ / log- LF(A;77^,r/^)dA. (53) 

Proof: The a.e.d. of Aj is given by Lemma [TJ Then, letting rij^ tends to infinity in (|50b| ). we immediately 
obtain the theorem. ■ 
Let r'^^ be the average sum-capacity upper bound. Then, for a large ra/j, the average sum-rate of the 
proposed SD scheme can be first-order approximated as 

R = R - hrt^^ (54) 

with r^^ given in (l53l) . 

We next study the symmetric case that the two users are equipped with the same number of antennas, 
i.e., riA = riB = V- 

Corollary 4: For < 77 < j^. 



J.SD 



J log ^/\i2- \)gi\; ri)dX; (55a) 



for^<ri<l. 



^'logVA(2-A) + ^ \og^ jg{X;r^)dX, (55b) 



where X*{r]) = 1 + \/l - {1 - 2r]f and 



l^(2A-A^)-(l-2r^)^ 
TT 2A - A^ 



G{^;v) = -- 72 • (55c) 



Proof: Letting Va = Vb = we obtain that T (A; r/^, r]^) = Q (A; r^) for 1 < A < A*, and T (A; rj^, rj^) = 
for A* < A < 2. In addition. A* (77) = | implies rj = j^. Based on these facts and Theorem IH we obtain 
the corollary. ■ 
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Remark 7: From the above, we see that, if r/ < ^, the probability of Aj > | approaches zero as nji — )■ oo, 
implying that complete decoding achieves a higher rate than PNC for all spatial streams. 

Corollary 5: The asymptotic normalized rate gap r^^ in (1531) is maximized at = r^g = 1/2, with the 
maximum given by 

_ 1 / logVAlZ^,,, ^ f '"g^ A « 0.053 bit. (56) 

Proof: We first consider optimizing r/^ and 7]^ under the constraint of Va'^ Vb = '^V- From (|52l) . we 
see that, for any AG (1, 2), (A; r^^, r^^) is maximized at f]A = Vb = so is r^^. 

What remains is to optimize r]. From (|55cl) . ^ (A; r^) is maximized at r/ = 1/2. Therefore, r"^^ is maximized 
at 1] = 1/2, which completes the proof. ■ 

Fig. [3] illustrates the function of the normalized asymptotic rate gap r^^ against i]. From Fig. |3l this 
rate gap is maximized at r] = 1/2, which verifies Corollary |5l Also, this rate gap vanishes as 7] tends to 
0, implying that, for any fixed ua = ub, the proposed space-division scheme can achieve the asymptotic 
capacity as ur tends to infinity. Moreover, this rate gap vanishes as rj tends to 1. This agrees with the fact 
that, for > 1, or equivalently, ua = ris > ur, the proposed space-division based scheme reduces to the 
GSVD scheme in [fT6]l . 

VIL Numerical Results 

In this section, we provide numerical results to evaluate the performance of the proposed space-division 
based network-coding strategy for MIMO TWRCs. The results presented below are obtained by averaging 
over 1,0000 random channel realizations. Rayleigh-fading is assumed, i.e., the coefficients in the channel 
matrices are independently and identically drawn from Mc{0, 1). 

We first present the numerical results for a MIMO TWRC of = = 2 and riR = 4 in Fig. |4l The 
sum-capacity upper bound (UB), the proposed space-division (SD) scheme, the GSVD scheme in [[T6l and 
the complete-decoding scheme in ffTTI are included for comparison. We see that, at a relatively high SNR, 
e.g., SNR = 25 dB, the rate gap between the proposed SD scheme and the sum-capacity upper bound is about 
0.15 bit/channel-use, which is almost unnoticeable. We also plot the high-SNR analytical result in (|54|) of 
the proposed SD scheme. We observe that our analytical result are very tight for SNRs greater than 10 dB. 
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From this figure, it is clear that the proposed SD scheme significantly outperforms the other schemes in the 
entire SNR range of interest. For example, at the rate of 14 bits per channel use, the proposed SD scheme 
outperforms the complete-decoding and GSVD schemes by more than 2.4 dB. The slope of the achievable 
sum-rate curve is parallel to that of the capacity upper bound, which implies that the proposed SD scheme 
achieves full multiplexing gain. 

In Fig. [51 we present the numerical results for a MIMO TWRC of = ra^ = 2 and riji = 3. The same 
set of rate curves from simulation as in Fig. |4] are included for comparison. Again, we see that the gap 
between the sum-rate of the proposed SD scheme and the sum-capacity upper bound is almost unnoticeable 
at a relatively high SNR, e.g., greater than 15 dB. The proposed SD scheme outperforms its counterparts 
throughout the SNR range of interest. 

In Figures |6] and 111 we show the scaling effect of the antennas on the average achievable sum-rates. We see 
that the asymptotic rate gap between the proposed SD scheme and the sum-capacity upper bound increases 
linearly as the increase of ur for fixed r/^ and r]^. For example, for the case of = = 1/2 in Fig. |6l 
the rate gap at SNR = 25 dB is 0.14 bits per channel use for ur = 4; 0.29 bits per channel use for ur = 6; 
and 0.40 bits per channel use for ur = 8. These numerical results agree well with the asymptotic results in 
Corollaries |4] and |5l 

In Fig. [8l we show the achievable rate-region of the proposed SD scheme. The capacity-region outer 
bound and the achievable rate-region of the complete-decoding scheme are also included for comparison. 
From Fig. [8l the difference between the achievable rate-region of the proposed SD scheme and the capacity 
region outer bound is negligible for a relatively high SNR. We also see that the proposed SD scheme can 
achieve rate-pairs that cannot be achieved by the complete-decoding scheme. 

VIII. Conclusion 

In this paper, we developed a new joint channel decomposition for MIMO TWRCs. Based on that, we 
proposed a space-division based network-coding scheme with the achievable sum-rate within ilog(5/4) ^ 
0.161 bit per user- antenna of the capacity upper bound in the high SNR regime. We also show that, for 
Rayleigh-fading MIMO TWRCs, the average gap between the achievable rate of the proposed scheme and 
the capacity upper bound is no more than 0.053 bit per relay-antenna in the high SNR regime. We remark 
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that this marginal gap is due to the fact that the complete-decoding and PNC strategies, collectively, fail 
to achieve the asymptotic capacity upper bound, even for the case of single-antenna users. To completely 
remove this gap, more advanced multi-dimension PNC relaying strategies may be required. Moreover, in 
this paper, channel state information is assumed to be globally known by both the transmitter and receiver 
sides. It is of theoretical, and more practical, interests to investigate how to efficiently communicate over 
MIMO TWRCs where only the receiver-side channel state information is available. We will look into these 
problems in our future research. 

Appendix A 
Proof of Theorem [H 

Here we prove Theorem [IJ Let Aj be an eigenvalue of UaU^ + UsU^ and Uj be the corresponding 
unit-length eigenvector satisfying 

U^U^ + VbV^b) = AiUi. (57) 

We are interested in four cases of A,;: (a) Aj = 2; (b) 1 < Aj < 2; (c) Aj = 1; and (d) < Aj < 1. 
For case (a), Aj = 2 implies that 

UAU^Uj = Ui and UijU^Ui = Uj. 

Thus, Uj lies in the common space of C(\Ja) and C(Ub). 
For case (c), we have 

U^U^Ui = Ui and U^U^Ui = (58a) 

or 

U^U^Ui = and UbU^u^ = u^. (58b) 
We next show that the eigenvalues in case (b) and case (d) appear in a pair-wise manner. Denote 

= (U]„U„)"' Ut^U, = VmUlui. (59) 



Note that l^ji is the projection of vector Uj onto the column space of Um- From (|57l) . we obtain 

U. = ^ (U;i + \B;i) ■ (60) 

A,; 



27 



The above implies that u^, and l^^j lie on the same two-dimension plane (denoted by Si). We have the 
following facts. 

Lemma 2: For any Aj in case (b), the corresponding Uj is the angular bisector of \A;i and I^;,, i.e. 

||U;i||^ = u|U;i = ujle-i = \\lB;if ■ (61) 

Proof: To prove the lemma, we first multiply both sides of (|57] ) by U^U^. Then, after some straight- 
forward manipulations, we obtain 

(A, - 1) U^U^u, = VaU^VbUW (62) 



Similarly, we have 



Then, 



(A, - 1) UbUU* = VbV^VaV^u,. (63) 



Ai — i 

('^^ tjT Tit TT Tit ('^^ tri Tit tl 

Aj — i 

where step (a) follows from (|59| ), (6) from (|62|) . (c) from the fact that the Hermitian transpose of a real- 
valued scalar is itself, (d) from (|63l) . and (e) again from (|59| ). From (|59| ). the projection of Uj onto l^.j is 

1 1 2 t 

just Thus, ||lm;j|| = ujlm-j, wWch completes the proof. ■ 
Lemma 3: For any Aj G (1, 2) (as in case (b)), A^ = 2 — Aj is also an eigenvalue of UaU^ + UbU^j, and 
the corresponding unit-length eigenvector is given by 

u: = (U, - b,) . (64) 

V K\ 

Proof: By definition, we have 

= (u^U^u, + (A, - 1) UbUU. - (A. - 1) U^U^u, - UbU1,u,) 

= -|= (u, - li.,) = a:u: (65) 

V AjAj 
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where step (a) follows from (|64l ). and step (b) from (|59l) . (I62l) and (|63l ). 

What remains is to show that ||u.|| = 1. To see this, we left- multiply both sides of (l57l) by u], yielding 

1 1 2 1 1 2 

||U;i|l + PB;i|l = Ai- (66) 

Together with (|6TI) . we obtain 

||U;i||^ = = y- (67) 

Moreover, left multiplying (l62l) and (1631) respectively by uj and plugging in (|59l ). we obtain 

1^1^;^ = (Ai - 1) llWf = (Ai - 1) \\lB;if = llMi- (68) 



Then 



A,;A 



A. 

- (^^ - - 1) llWf - (Ai - 1) 

(6) 



1 (69) 

where step (a) follows from (|67l) and (|681) , and step (6) from (|67] ) and the fact of A- = 2 — A,. This completes 
the proof. ■ 

Lemma 4: The subspace Si spanned by 1^;,; and l^^j is orthogonal to Sj, for any j ^ i. 

Proof: From (|60l) and (|64|) . we see that both and u- lie on the plane Si. As Uj and are orthogonal to 
each other. Si is spanned by Uj and u^. Then, the lemma holds straightforwardly by noting the orthogonality 
between the eigenvectors. ■ 

Now we give an overall picture of the eigenvalues and eigenvectors of U^U^ + U^Ulj. Denote the 
k eigenvalues in case (a) by Ai,--- , A^, and the corresponding orthogonal eigenvectors by ui,--- , u^. 
Also denote the / eigenvalues in case (b) by A^+i, ■ ■ ■ , Xk+i in the descending order, and the corresponding 
eigenvectors by u^+i, ■ ■ ■ , u^^^. As the eigenvalues in (b) and (d) appears in a pair- wise manner, we further 
denote the / eigenvalues in case (d) by X'f^_^_i, ■ ■ ■ , A';.^; in the descending order, and the corresponding 



29 



eigenvectors by u^_,_^, ■ ■ ■ , u'^+j. Moreover, we denote the orthogonal eigenvectors in case (c.l) by 
Uk+i+i, ■■■ , Uk+i+dA, and the ds orthogonal eigenvectors in case (c.2) by Uk+i+d^+i, ■ ■ ■ , Uk+i+d^+dB- Let 



It can be readily verified that U is an orthonormal matrix satisfying U^U = In^+ns-fc- Define 



(70) 



U' 



UI5 ■ ■ ■ 5 Ufc, — -, ■ ■ ■ , 



II 



A;k+1\ 



A;k+l\ 



and 



B 



I 



Ul, ■ ■ ■ , Ufe, 



_B:fc+l 



1 



|lB:fc-l-1 II ' ' l|lB:i 



(71a) 



(71b) 



In the above, ui, ■ ■ ■ , Ujt are the eigenvectors in case (a); Ufc+^+i, u^+^+rf^ are the eigenvectors in case (c) 
satisfying UAU^Ui = Ui, for i = k + I + I, k + l + dA, Uk+i+dA+i^ " " " > Ufe+i+dA+ds are the eigenvectors 
in case (c) satisfying UbU^Uj = Uj, for z = /c + / + + 1, k + I + dA + ds- 
Then, from Lemmas 3 and 4, it can be verified that in (|19a|) satisfies 



U:„ = UD„,mGK5} 



(72) 



From Lemma 3 and the fact that G 5j for z = + 1, ■ ■ ■ ,k + l, the columns of are orthonormal. 
Together with the fact that all columns of lie in the columnspace of Um (and so in the columnspace of 
Jim), we see that and U'^ share the same columnspace. Thus, there exists an n^-by-nm square matrix 
Gm such that 

Hm_R = U'^Gm- (73) 



Combining (1721 ) and (1731 ). we obtain 



H 



mR 



(74) 



which completes the proof of Theorem [TJ 



Appendix B 
Proof of Theorem [3] 



30 



We first consider the sum-rate upper bound: 



R 



UL 



(a) 1 



m£{A,B} 



(J 1 

2 



me{A,B} 
~ 9 2^ log 



In™, "t" 



P 

± IT 



-UD^R^R|„D|„U^ 



p 



R RT 



(75) 



me{A,B} 

where step (a) follows by substituting (|24l) into (|49l) . step (Z?) follows from the facts that Dj,jU^UDm= 
and |I + AB| = |I + BA|, and step (c) utilizes the fact that is a square matrix. 

Now we consider the achievable sum-rate of the proposed space-division scheme. For notational simplicity, 
let = Dm;2,2^m;2,2, ^ {^,B}. From (|34l ). the sum-rate of the complete-decoding spatial streams 

can be expressed as 



RT + R'b'' hog 



I 



E 



m£{A,B} 



Norir, 



(6) 1 , 

= - loe 
2 ^ 



1 + 



P4 



me{A,B} 



(c) 1 , 

~ - log 



Pa 



(d) 



NqUa 

E 



-R/l:2.2Rt 



^A;2,2-CVyl;2,2 



1, 



-R-m;2,2R-m;2,2 



NqUb 



:Hg^)t ( I 



0^2,2 ( I + 



P 



H 



CD 



NqUa 



D 



B;2,2 



m<^{A,B} 

where step (a) utilizes the fact that equal power allocation is asymptotically optimal, and step (d) follows 
by substituting H^'^ = Db;2,2Rb;2,2- Applying the matrix inversion lemma to (\ + -^^^HJ^^(H^^)^^ , 
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we further obtain 



dCD I tdCD 



m£{A,B} 



Pr 



NqU. 



P 



B;2,2 



(a) 



(b) 



m,e{A,_B} 



p 



me{A,B} 



iVon, 
P 

J. 7-)-) 



"R'm;2,2R'm;2,2 



■R'm;2,2R'm;2,2 



1 , 



I ~ 2D^;2,2DV2 2Db;2,2 



fe+i 



(76) 



i=fc+r+i 



where step (a) follows by noting I + ^(H2^)tHS^ ^ ^(H50)tH2^ and H^^ = D^;2,2Ra;2,2, 
m G {A, 5}, and step (Z?) utilizes the definitions in (|19al) and (l26l) . Moreover, letting = wb = 1 in (|42|) . 
we obtain the sum-rate of the PNC spatial streams as 

p 



tdPNC , oPAfC 



i E 



+ log 



D™;i,iD|,;i,i|) . (77) 



me{A,_B} 

From (l44l) . is the angular bisection of e^.^ and Bba, or equivalently, pj = [1, 0]^, for the sum-rate case 
of wa = Wb = I- Then, using the definition in (|33] ). we obtain 



log 



Dm;l,lDm:l 



k+l' 



(78) 



=fe+l 



Combining (TTSII -dTSl). we complete the proof of Theorem 3. 



Appendix C 
Proof of Lemma □ 

We prove by using the theory of free probability ETl . The a.e.d. of UmUj„ is given by 

Pm (A) = (A - 1) + (1 - riJ S{\),me {A, B} . 
Let Xjn be a random variable with PDF pm (A). Its Stieltjes transform is given by (cf., (2.40) in [26J) 



Sx^ (z) = E 



1 



Then, the inverse function of Sx^ {z) is given by 

Sxl is) 



l-z 



- (1 - s) ± V ( 1-3)^-4. 

2s 
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Using the relation between Stieltjes transform and R-transform (cf., (2.72) in [|26ll ). we obtain the R-transform 

of Xm as 



1 Z-'^T\iZ-l) +4:7J^Z 



' z 2z 

From Theorem 2.64 of [26], as U^iUl^ and U^U^ are asymptotically free random matrices, the R-transform 
of the a.e.d. of U^U|4 + UsU^ is given by 



Rab {z) = Rx^ (z) + Rxs (z) = Yl 

me{A,B} 

Then, the Stieltjes transform of the a.e.d. of UaU^ + UsU^ satisfies 



z-lT \ [z-l) +4:r]^z 



2z 



■me{A,B} 

Letting y = S^b (—z) , we obtain 



{z - if + Ar]^z 



2z 



J2 \/{z-lf + 4v^z = T2z{y-l). 

■me{A,B} 



Multiplying -v/ (2 — 1)^ + 4:ri^z — y {z — if + '^VbZ on both sides, we have 



'z - if + Ar^^z -J{z-lf + Ari^z = T 



2(^A - Vb) 



y-i 



Adding the above two equations and taking the square, we further obtain 



{z-lf + Ar,AZ={z{y-l) + ^-^-^ 



Solving z, we obtain 



Sab (z) 



l~VA-VBT^{l-VA-VBy + i2z-z^)[{^-^f-l 

2z - z^ 



From (2.45) in [,26,1 . the a.e.d. of U^U]^ + UsU^ is given by 



J^(A) = lim -lm[SABiX+M]. 



Thus, for < A < 1 and 1 < A < 2, we obtain 



J^(A) = -Im 

TT 



l-VA'VBy+{2X-X')[m^)'-l 
2A^A^ 



(79) 
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In addition, for a randomly generated pair of \Ja and U^, there are ua + ub — ur orthogonal eigenvectors 
for Aj = 2, \nA — ubI orthogonal eigenvectors for = 1, and ur — ua — ub orthogonal eigenvectors for 
Aj — 0. Thus, as ur tends to infinity, the PDF T{X) at A = 2 is given by [tja + tjb — i]'^ S {X — 2); that at 
A = 1 is given by \rjA — VbI S {X — 1); and that at A = is given by [I — tja — Vb]'^ ^ W- This concludes 
the proof of the lemma. 
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Fig. 1. Configuration of a MIMO TWRC. 
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Fig. 2. The uplink rate-regions of the TWRCs with single-antenna users, = [1,0]"^ and hs = [cos sin 0]"^. Channel SNR = 1/A'o = 10 
dB. The horizontal axises represent the rate of user A; the vertical axises represent the rate of user B; the unit is bit per channel use. 




Fig. 4. Average achievable sum-rates of various schemes for the Rayleigh fading MIMO TWRC with ua = ub = 2 and hr — 4. 
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Fig. 5. Average achievable sum-rates of various scliemes for the Rayleigh fading MIMO TWRC with ua = ub = 2 and hr = 3. 
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Fig. 6. Scaling effect of the average sum-rates of various schemes for the Rayleigh fading MIMO TWRCs with rj^ = rjg = 1/2. 
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Fig. 7. Scaling effect of the average sum-rates of various schemes for the Rayleigh fading MIMO TWRCs with rj^ = rj^ = 2/3. 
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Fig. 8. Average achievable rate-regions for the Rayleigh fading MIMO TWRC with ua = ub = 2 and riR — 3. The average SNRs for all the 
channel links are set to 30 dB. 



